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Abstract 

Over the years numerous models of SIS (susceptible infected susceptible) disease dynamics unfolding on 
networks have been proposed. Here, we discuss the links between many of these models and how they can be viewed 
as more general motif-based models. We illustrate how the different models can be derived from one another and, 
where this is not possible, discuss extensions to established models that enables this derivation. We also derive a 
general result for the exact differential equations for the expected number of an arbitrary motif directly from the 
Kolmogorov/master equations and conclude with a comparison of the performance of the different closed systems of 
equations on networks of varying structure. 

1 Introduction 

Modeling the spread of infectious diseases requires an understanding of not only disease characteristics but also an 
understanding of the community (be it a hospital, school, town, etc) in which it pervades. An important consideration 
in modelling the spread of diseases is thus the contact structure on which disease transmission happens. Whereas 
traditional approaches (|l2]|5]) assume little or no topological structure, recent work ([T3] [14] [T5l ) has tried to incor- 
portate the underlying linkages between entities in the population and study how these links facilitate the spread of 
the disease. For a continuous-time stochastic disease transmission model on an arbitrary network it is possible ( IfTTl ), 
to write down the relevant Kolmogorov/master equations and thus model it as a continuous time Markov chain that 
fully describes the movement between all possible system states. Unfortunately the complexity of the model comes 
from the size of the state space and the number of equations scales exponentially as a^, where a is the number of 
different states a node can be in and is the network size. One widely used resolution to this complexity is to create 
individual-based simulation models and investigate the system behaviour directly. Even though increasing computa- 
tional power makes simulations an increasingly attractive proposition they lack analytic tractability. Whilst this is not 
always a hindrance, when the system displays a rich range of behaviour (e.g. oscillations, bistability) it may not be 
feasable to obtain a global overview of the effects of different parameter values and thus the more analytic approach is 
needed. For this reason, low-dimensional systems of differential equations ( lfT3l l6l [T5]| ) are sought provided that these 
can approximate the exact solution. By reducing the problem to a smaller system of equations it is easier to study the 
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bifurcation structure of the model and gain a greater understanding of the full spectrum of behaviour The challenge is 
then finding the set of equations that best approximate the solution of the Kolmogorov equations. 

Given that here we focus on epidemic models, usually such models are formulated in terms of the expected values 
of the number of infected and/or susceptible individuals or some other motif in the network such as the expected 
number of infected and/or susceptible individuals of different degrees (the number of connections a node has). Such 
models range from classic meanfield [1] to pairwise [13], heterogenous pairwise [6|, effective degree |]T5][T6| and 
individual-level models ll20ll to name a few. Whilst these models seem to use different approaches their derivation is 
based on the same conceptual framework, namely they begin by choosing a base-motif (e.g. a node, a link and the two 
nodes it connects, a node and all its links). These base-motifs are then used to formulate equations for the different 
possible states that they can achieve (e.g. for the expected number of motifs in different states or the probability that a 
specified motif in the network is in a certain state). These equations generally involve not only the base-motif itself, 
but larger or extended motifs of which they are usually part of. These larger motifs in turn depend on more complex 
motifs and a closure is needed in order to obtain a self-contained system of equations of reasonable size. Importantly 
the base-motif determines not only the complexity of the model (the larger the motif the greater the number of states 
it can be in) but also how much of the network topology can be captured. Interestingly differential equations for 
smaller motifs that are part of the base-motif should, in theory, be recoverable from the original differential equation. 
To this end the main focus of the paper is the consideration of various simple models of disease dynamics and the 
relations between them. We also consider which models are derivable directly (subject to a suitable closure) from the 
Kolmogorov/master equations and can thus be referred to as exact. 

We begin in section|2]with an introduction of some of the more common approaches to modelling disease dynamics 
on networks, considering meanfield (| 1 1), pairwise (|13 1), heterogeneous pairwise (|6|) and the effective degree ([15 1) 
model formulations. In section |3] we formulate an exact version of the effective degree model and then illustrate 
how the pairwise model can then be recovered from this new set of equations. We are, however, unable to recover 
the heterogenous pairwise model from the exact effective degree and this motivates, in section |4] an extension of 
this which incorporates further network topology into the ODEs. From this extension we then show how it is then 
possible to recover the heterogeneous pairwise equations. Once the links between the models have been established, 
in section|5]we show how the unclosed version of the models can be derived directly from the Kolmogorov equations. 
This is done by proving that as long as the heuristic equations for any motif are written following a certain set of rules 
they will always be exact. We conclude, in section |6] with a brief comparison of the models and discuss under what 
circumstances they perform best, in the sense of being close to simulation results. 

2 Models of disease dynamics 

In this paper we focus on susceptible infected — susceptible (SIS) disease dynamics on networks but note that 
aU of the following models can be adapted for other disease (e.g. SIR and/or contact tracing) or non-disease (e.g. 
evolutionary fSl) dynamics. With this in mind we use T as the per-link transmission rate between susceptible and 
infected nodes and y as the recovery rate of an infected individual. Both infection and recovery are modelled as 
independent poisson processes. As a starting point we give a short summary of ODE-based models that are either exact 
or an approximation of the true dynamics resulting from the full system based on the Kolmogorov/master equations, 
where these are solvable, or based on simulation. 

2.1 Pairwise and the resulting simple compartmental model 

In order to focus on the underlying network of contacts, we introduce the pairwise model first ( llT3l[T9l ). The main idea 
of this model is to develop the hierarchical dependence of lower order moments (e.g. expected number of susceptible 
[S] and infected [/] nodes) on higher ones (e.g. expected number of pairs with one susceptible and one infected node, 
[SI]) and to derive appropriate models that correctly account for these. As already suggested, the expected number of 
pairs will depend on larger motifs, in this case these being the expected number of triples denoted by [ABC], where 
A,B,C e {S,I} and B is connected to A and C. Using this notation the equations governing the evolution of the disease 
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dynamics at the level of singles and pairs are given by 




2t{[ISI] + [IS])-2y[II]. 



T{[iss]-[isi]-[is])+rm-m, 



2t[ISS]+2Y[IS], 



(4) 



(2) 



(3) 



(1) 



Importantly we note that these equations are unclosed as no equations are given for the evolution of the triples. The 
standard closure (in the absence of clustering) makes the assumption that the status of pairs are statistically independent 
of one another and then 



where n is the average degree of the network. When we use this closure we say we have closed "at the level of triples" 
. In order to derive the classic mean-field model a closure at the level of paris can be applied, namely, [SI] can be 
approximated as 



and upon using Eq. ([U, the classic mean-field model can be recovered 



where the widely used transmission rate from the compartmental model, |[T1, is /3 = Tn. 

It is also important to note that the unclosed equations above (Eqs. ([T]|4|i) can be derived directly from the state- 
based Kolmogorov equations and for this reason we refer to these equations as exact. Whilst a proof for the exactness 
of these equations was given in ||2TI . in section|5]we provide a more general proof that allows us to write down exact 
equations for, not just pairs, but any motif structure. We also note that an alternative approach was used by Sharkey 
in 1 20 1, to prove that the standard pairwise equations were exact for models with susceptible — ;> infected — ^ recovered 
(SIR) disease dynamics. 

2.2 Heterogeneous pairwise model 

Whilst the pairwise equations perform well in capturing disease dynamics on networks that are well described by 
their average degree, the closure assumption fails when greater heterogeneity is introduced. More precisely, whilst 
the pairwise equations above are exact for an arbitrary network before a closure, these do not guarantee that with the 
current choice of singles and pairs (i.e. [S] could be further divided to account for heterogeneity in degree) a valid 
closure could be found for any network. Indeed, to account for greater heterogeneity Eames etal. 1 6 1 further developed 
the pairwise model by taking into account not just the state of nodes and pairs but also the degrees of the nodes. By 
using [A"] to represent expected number of nodes of type A with degree n and with similar notation for pairs and 



[ABC] «[AB](n-l) 



[BC] 



n[B] 




(5) 
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triples, they were able to formulate the following set of unclosed equations 

^.[5«]=7[/"]-T^[5«/?], (6) 



dt 

d_ 

dt 



[I-]^-Y[r] + xY,[S"I% (7) 



^ = - T ^ ( [5«5"7'?] + ) + 7 ( [5"/'"] + [I"S"'] ) , (8) 

^ [s"!'"] = T ^ ( - [r's"r] ) - T [s"!'"] - r [s"r] + r [/"/'"] , (9) 

^ [/"/'"] = + [IIS"!'"]) + T [rr'] + T [5"/'"] - 27[/"r] . (10) 

1 

Again assuming the statistical independence of pairs and absence of clustering, Eames et. al, f6\, suggest the following 
approximations of triples 



\B"C"'DP] w \B"C'"] (m - 1 ) 

m[C" 



2.3 The effective degree model 

In ifTSi . Lindquist et al. introduced the effective degree model for SIS (and also SIR) dynamics on a network (an 
equivalent model formulation was also proposed by Marceau et al. |16|). In this model they consider not only the 
state of a node (S or /), but also the number of the immediate neighbours in the various potential states. This is done 
by writing the following set of equations for all the possible star-like motifs in the network where Ss.i (Isj) represents 
the expected number of susceptible (infected) nodes with s susceptible and / infected neighbours, 

S],i = - TiSsj + Yh.i + 7[(' + - iSs.i] 

— M ^ + Ij'^i+i,;-! -■S'^i,/], (11) 



J2k=lJ2j+l=kJ^.iJ 

Is J =xiSs.i - Yls.i + 7[(' + 1)^-1, ;+i - il.'^A 

+ t: M ^ —[{s+l)ls+i.i-i-sls,il (12) 

with 1 < s + i <M, where M is the maximum degree and the equations are suitably adjusted on the boundaries. It is 
important to note that this model is not exact as a closure has been already applied. Namely the infection of a node's 
susceptible neighbours is based on a population-level approximation. To illustrate this more precisely we borrow the 
notation of the pairwise model and make two observations 

T.tii:j+i=knsjj _ [iss] 

j:f=ij:j+,=ki'sj., ^ i:txT.j+,=kKi-^)Sjj+iSj,i ^ m+isi] ^ m 
Et^Ej+t^kJ^jj Y.t^i:j+i.kjij.> [^^] [^^] ■ 

These means that the infection pressure on the susceptible neighbours of the central node is equal to the population 
level average taken from all the possible star-like configurations rather then from the extended star structures that 
would account exactly for these infections. 
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3 Recovering the pairwise model from the effective degree 



Whilst the pairwise and effective degree models seem different they are based on a similar approach. Both models 

work on approximating the evolution of different motifs in the network; individuals and links in the pairwise model 

and star-like structures in the effective degree. For both models, but more clearly for the pairwise, the models begin 

with a starting or base motif (e.g. nodes) for which an evolution equation is required. This will of course depend on 

an extended motif, typically the base motif extended by the addition of an extra node (e.g. pairs). This dependency on 

higher order motifs continues, for example, with pairs depending on triples, and then triples depending on quadruplets 

I 1 

(four nodes connected by a line, i.e. A — B — C — D, or a star with a centre and three spokes, i.e. A — B — C D). Hence, 
the models only differ in the choice of the base motif and then potentially in the way in which the systems are closed 
to curtail the dependency on higher order motifs. Since, here we are mainly interested in exact models, that is before 
a closure is applied, we begin by conjecturing an exact version of the effective degree model and show how starting 
from this the exact pairwise model can be derived. 

3.1 Exact effective degree 

Based on the ideas presented above, we extend the star-like base motif to reveal the dependence on higher order motifs 
and conjecture that this unclosed version of the effective degree model is exact. We begin by introducing a variable 
to count the expected number of infecteds connected to a node's susceptible neighbours. This is done by introducing 
two new terms, ,/] and ,/]. For the term ,/] (and similarly for [755^/ ,/]) the S in the middle is actually 
used to represent the susceptible neighbours of the central / from the motif with composition T,/,/ (i.e. the / node with 
neighbourhood (i',/') is the centre of the star, while 5 is a susceptible spoke). The / (on the left-hand side), in turn, 
represents the infective neighbours of these susceptibles' and within this count, in the case of ,/], we also include 
the originating central /. The exact effective degree model can then be written as 



Fig.[T]shows the possible transitions captured by this model. 

3.2 Recovering the pairwise equations 

The star-like composition of the effective degree model allows us to recover the pairwise equations via careful sum- 
mations. The full derivation of the pairwise model is given in Appendix 1, whilst here we only illustrate the derivation 
of the individuals (trivial but given for completeness) and the [//] pairs. 



- xiS,,i + yh,i + y[{i + 1 - iS,,,] 

+ t[/55,+i,,-_i]-t[/55,,,], 



(13) 



; - Yh.i + 7[(' + lKv-l,i+l - 

+ t[/5/,+i,,_i]-t[/5/,,,-]. 



(14) 




S.l 




where most terms from the original effective degree equations cancel and we have used that J^si^'^s.i = [SI] and 
, = [/]. For [//] the following equahty holds 
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Figure 1: Illustration of the transitions into and out of the 82^ class. Susceptible nodes are given in blue and infective 
nodes in red. Transitions into the class are shown in grey and transitions out in green. The corresponding terms of the 
general equation are also given. 
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s,i 

+ y[ni] - yY i{i - l)Is,i - rJ2 

+ xJ2 - WSIs+i,i-i] + [/57,+i,,-_i] - T^i[/57,,,-] 
=t:[ISI] + t[IS] - 7[77] + 7[777] - 7[777] - 7[77] + t[757] +t[IS] 
=2t{[ISI] + [IS])-2y[II], 

where we have used that E,,./7,,- = [77], E5,i(j- = Ei[tSIs,i] and that ^ [757,+i,,_i] = [757] + [57]. 

These all follow from the definition of the pairwise model and the definition of the new extended motifs from the exact 
effective degree model. We note that this result does indeed correspond to that of the given pairwise model. 



4 Higher order models 

Whilst we can recover the pairwise equations from the exact effective degree model we note that the same is not 
possible with the heterogeneous pairwise equations. This motivates an extension of the effective degree model where 
the degrees of neighbouring nodes are also taken in to account. Again we conjecture that this model can, in theory, be 
derived from the exact Kolmogorov equations and thus refer to it as exact. 



4.1 Exact effective degree with neighbourhood composition 

We extend the exact effective degree model to include the number of neighbours of the central nodes' neighbours. We 
begin by defining the following notation 

i' = {kJi, ■ ■ ■ ,'m), 



■-SI+S2' 



-sm, 



\i I =h+i2 + --- + iM, 

where sj (ij) represents the number of susceptible (infective) neighbours of degree j. We now define ^j/,/, (Ty/,/) as the 
number of susceptible (infective) nodes with neighbouring nodes whose own degrees are given by the entries in s' and 
i'. We can now write the extended ODEs in the following form 

M 



Ss/i' = - T:\i'\Ss,'i' + Yls',i' + yYj (4 + - r\^\^s',i' 



k=l 



M 



k=l 



(15) 



M 



Is'/ =T:\i'\Ss',i' - Yh',i' + yJ2 (4 + - y\^'\^s',i> 

M 



(16) 



k=l 



1 



Here s[_ ^ {si,S2, ■ ■ ■ ,Sk — 1, . . .,sm) and s'j^^ — {si,S2,---,Sk + l,--- ,sm) with a similar definition for and i'j^^. 

are taken to represent 



With a small modification to the exact effective degree notation terms such as 
number of infectious contacts of the susceptible neighbours of degree k. 

4.2 Model recovery 

Here we show how, from the extended effective degree model, we can recover the heterogenous pairwise model. It 
is also straightforward to show, and thus omitted here, that the extended effective degree leads to the simpler exact 
effective degree. In turn, it also follows easily that both the exact effective degree and heterogenous pairwise models 
reduce to the standard pairwise model. This hierarchy of recovery is illustrated in Fig.|2] 



4.2.1 Recovering the heterogeneous pairwise model from the extended effective degree 

As earlier we make use of careful summation to recover the model. The full derivation is provided in Appendix 2 so 
here we just provide the derivation at the individual level and of the [/'/"] pairs. For singles the following identities 
hold, 

^ s,,,^r[i"]~T[s"i], 

\s'\ + \i'\=n 
|s'| + |i'|=n 

where most terms from the original effective degree cancel and we have used that 

J2 Is'.i' = [n and \i'\S.'..' = [S"I]. 

|s'| + |i'|=« W\ + \i'\=n 
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For the [/ /"] pair we obtain 



d 
dt 



I'l" 



\s'\-\-\i'\=n 

=T ^ /; I i'\s,,j, ~ yY. 'i^^'-i' + ^E 'i E (''k + 1 

k=l 

M 



M 



k=\ 



=Tj2i', (|/'| - 1) S,,, + rY^i'iSs'/ - 7 [/'/"] 

+ 7 [/'/"/] - rE '/ - 1) - ^E ''^A'-' 
+-E'lEK.4-]+-E(';-i)K,.;_ 

27 [/'/"] 



+ r [/'/"/ 
=t|/'5'7] +t 



I'S" 



l'l"l 



IS'l" 



S'l" 



:J:{[i'S"1 



I'S" 



I'r 



-27 

/"S"/"' ] ) + T 



75'/" 



5'/" 



27 



jljn 



1 

Again, we note that this resuh corresponds to previously given heterogenous pairwise model. 

5 Exactness of the models 

In the previous sections we have at times referred to a set of ODEs as being exact. This terminology implies that the 
ODEs can be derived directly from the Kolmogorov equations which describe the evolution of the epidemic through 
the full state space S (on a network of size N, S — {S,I}^). In OTI the exactness of the pairwise equations was 
rigorously proven but no other motif structures were considered. In section [STl we conjectured that the newly defined 
exact effective degree model is derivable from the Kolmogorov equations. Due to the structure of the motifs used in 
the effective degree model a mechanistic proof (as in [21]) may be difficult and intricate to implement. Instead we 
will prove that a heuristic formulation of the ODEs for any motif structure is indeed exact providing they are written 
following rigorous bookkeeping This derivation of the evolution equations for an arbitrary motif, directly from the 
Kolmogorv equations, will be based on an extension of ideas presented in [11] and 121] and using the notation defined 
in Table □ 

We should note that in what follows a motif of connected nodes will only ever be counted once. In a network of 
size and in a motif, m, with k nodes this singular counting can be understood in the following way. We consider 
each of the (^) unique sets of k nodes between 1 and A^. Then for each set whose nodes are isomorphic in topological 
structure and status to the motif m, we simply increase the counter of such motifs by one. This formalism is unlike that 
used in the standard pairwise model where an SS link would contribute a value of two to the [SS] count. However, the 
two resultant sets of equations are equivalent in the sense that the different ways of counting can easily be recovered 
by using a simple mapping between the two. For this reason, whilst we prove that the following theorem is correct, 
it's intricacy and generality means a certain amount of care is needed when interpreting the resultant terms. Table |2] 
summarises the additional notation needed for the newly introduced motif approach. The result for a general motif is 
given in the following theorem. 
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(a) Extended effective degree 




Figure 2: Illustration of the hierarchial structure of model recovery. Links that are known are given by lines and 
knowledge of a nodes status is given by circles. The upper level (a) represents the extended effective degree ODEs. 
The status of the central node is known along with that of it's neighbours and also their degrees. The secondary level is 
given by (b), the effective degree model where there is no knowledge of neighbours' degrees and (c), the heterogenous 
pairwise model where the number of pairs of nodes and their relative degree is know. The final level shown, (d), is 
known as the standard pairwise model, ifTsl . where the status of individual nodes and pairs is used. 

Theorem 1 

The equation for the expected number (|A^|) of motifs of type m, given by 

+ YN\m+,m)~Y\M\N'{m) (17) 
is derivable directly from the exact Kolmogorov equations. 

5.1 Proof of Theorem 1 

For a detailed description of writing the Kolmogorov equations for an arbitrary graph we refer the reader to [TT'. Here 
we only provide a brief description making use of the notation defined in Table [T] to allow us to illustrate the proof. 
Setting X = . . . ,X^Y , the epidemic evolution through the state space is given by 

X = PX, (18) 

where 



( fiO 






















c' 
















A2 




















A3 




C3 
























V 











A^ 




) 
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Table 1: Notation for matrix representation of the Kolmogorov equations (Table from f2T|). 



Variable 


Definition 




Number of nodes in the network 


G=(g„)e{0,l}'v\/,; = L2,., 


. . Adjacency matrix with gij = 1 if nodes / and j are connected and 
gij = otherwise. The network is bi-directional and has no self 

cn/^Vi tlra f — on/H {~^-- — V\ W i 
lUUUa aLH^ll LilaL \J — \J allU \Jii — - W, V t. 


T 


Rate of infection per (5,/) edge. 


r 


Rate of recovery. 


c _ re T\i^ 


State space of the network, with nodes either susceptible, or 
infected, / and \S\ = 2^. 


o — -[Oi ,02, . . . ,0^^! 


The Cj^ = (^) states with k infected individuals in all possible 
configurations, with ^ = 0, 1 , . . . , A^. 




Probability of being in state iSj' at time f , where A: = 0, 1 , . . . , 
and; = 1,2,...,q. 








Rate of transition from 5^ ' to S\, where A: = 0, 1, . . . ,A^, ; = 
1,2, .. . ,Ci. and j — 1,2, .. . ,Ci._i . Note that only one individual 
is changing (i.e. in this case an S node changes to an / through 
infection). 




Rate of transition from 5*^' to iS^, where A = 0, 1, . . . ,A^, / = 
1 , 2, . . . , Q and y = 1 , 2, . . . , q+i . Note that only one individual 
is changing (i.e. in this case an / node changes to an S through 
infection). 




Rate of transition out of <S^, where ; = if / 7^ /' with k = 
0,l,...,A^and/,; = 1,2,...,q. 
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Table 2: Additional notation for matrix representation of the Kolmogorov equations 



Variable Definition 



m An arbitrary motif encompassing both topology and status of nodes (e.g. an S — I 

edge or a star like structure such as 73 o). The arbitrary motif we are consdering 
which will encompass bolh lopology and slalus of nodes. 

m"*" Represents the different motifs with the same structure as m but with a susceptible 

node of m having become infected. 

m~ Represents the different motifs with the same structure as m but with with an in- 

fective node of m having become susceptible. 

Mi^j Set of m motifs in configuration state Sj. Defining the i"' element of M^j as m[ j 

gives Mkj = {ml J, ml J, . . . .m' j'}- 

M^j The set of motifs, in configuration state Sj, with the same topology as m but with 1 

more infective and 1 less susceptible. Defining the i"' element of M^j as m'^j gives 

Mj^j The set of motifs, in configuration state <S*, with the same topology as m but with 

1 less infective and 1 more susceptible. Defining the j'"" element of M^j as m^"- we 

have M^j = {m^ . , m,^ . m^j ' }. 

NfiiiS'j) Number of m motifs in state <S^, with k = Q,\,...,N and 7 = 1,2,... ,0^. 

^il i^) Number of 5/ links within the motif h. 

Nf^{h) Expected total number of SI links within all motifs of type h 

Nf^{h,k) Number of SI links within the motif h, along which, were an infection to occur, 
would result in a motif of type k. 

Mf^ih, k) Expected total number of SI Unks within all motifs of type h, along which, were an 
infection to occur, would result in a motif of type k. 

^ex W Number of SI hnks where the 5 is contained within the motif h and the / is external 
to it. 

■Afg'ih) Expected total number of SI Unks to all motifs with structure h, where the S is 
contained within the motif h and the I external to it. 

N^{h,k) Number of SI links where the S is contained within the motif h and the I is external 
to it, along which, were an infection to occur, would result in a motif of type k. 

J\fg^{h,k) Expected total number of SI links to all motifs with structure h, where the S is 
contained within the motif h and the I external to it, along which, were an infection 
to occur, would result in a motif of type k. 

N' (h) Number of / nodes within motif h. 

N' {h, k) Number of / nodes within motif h, whose recovery lead to a motif of type k. 

{h,k) Expected total number of Is within motifs of type h, whose recovery lead to a motif 
of type k. 
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We now have the following equations for the state space probabilities. 





^b'x^^ 


^C^X\ 






+ bV + cV+^ foTk^l...{N-l 


x" 







From LIU , we also know that the entries of the matrix B are zero except on the diagonals, where we find that 



^5y = -E4r-E^7' 



=1 



=1 



= -TNsi{s'j)-kr. 



(19) 



Where fTl | focussed on individual and edge motifs here we focus on the derivation of evolution equations for the 
expected number of and arbitrary motif, m. We begin by writing the exact equations for an arbitrary motif m based on 
the transition and recovery matrices. This yields. 



\M\^Y.^,f,{S')x' 

k=0 

=Na{s')[b''x°+c°x'] 

N-l 



aV-^+bV + c''x''+^ 



k=\ 



N N 

= ^ N,^, (5^-)A*^X*- ' + ^ A^rf, {S'')bV + ^ N,n {S'')CV+^ 



N-l 



k=l 



k=Q k=0 
N N 



= ^ N,;, {S''+^)A''+^x'' + J2 N,h iS'')BV + J2 N,n {S''- ' )C'^" ' X 

k=0 k=0 k=l 

= [N,,{S')A'+N^iS')B'>]X^ 

Af-l 

+ Y, [NdS'^')A'+' +N„y{S')B'+N,f,{S'-')C 

k=\ 

Before continuing we note the following 



X' 



B''=Bl, 



b'^ = b% = - 



!=1 

N 

^A,!,=-TA^,K5?)=0. 



(20) 



!=1 



Taking these and (fT9] l into account and using the fact that B is only none zero on it's diagonal, we then obtain the 
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following equation, 

\M\ =N^{S')A'X'^ 



N-l 



k=l 



N-l 



(21) 



k=l 



k=l 



We note that the term containing X*^ vanishes because is a column vector with all zero entries. We now consider 
the summations involving the A and C matrices: 



(=1 

=riT |^A'rij(>S'y) + (number of m gained by node 1 becoming infected ] 
— (number of m lost by node 1 becoming infected )] 



+r2X 



N^i^'j) + (number of m gained by node 2 becoming infected ] 



— (number of m lost by node 2 becoming infected )] 



+ ... 



NmiS'j) + (number of m gained by node {N — k) becoming infected ) 



— (number of m lost by node (A'^ — k) becoming infected )] 

=ri T Nm (S'j) + (number of elements of M^j where node 1 is susceptible 

and where node I's infection would lead to a motif of type m) 

— (number of elements of Mj^j where node 1 is susceptible )] 

+r2X N,fi{S'j) + (number of elements of M^j where node 2 is susceptible 

and where node 2's infection would lead to a motif of type m) 

— (number of elements of Mj^j where node 2 is susceptible )] 
+ ... 

+rjv-ftT N,f,{S'j) + (number of elements of M^j where node (A'^ — k) is susceptible 

and where node (A'^ — kYs infection would lead to a motif of type m) 

— (number of elements of M^j where node (A'^ — k) is susceptible )] , 



grouping the terms we obtain, 

N^{S'^+')A'+']=xNs,iS'^j)N^iS'^j) + xJ2 [Nf(ml--,m)+^f (4j>'^) 
■' i=i 
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Similarly, 

(=1 

=7 |^^m('5'*) + (number of m gained by node {N — k+\) recovering ] 



— (number of m lost by node (A^ — ^ + 1 ) recovering )] 



+7 



+ ... 



+7 



Nm {Sj) + (number of m gained by node {N — k + 2) recovering 
(number of m lost by node {N — k + 2) recovering )] 

NmiS'j) + (number of m gained by node (A^) recovering ) 



— (number of m lost by node (A^) recovering )] 

=7 Nm (5y) + (number of elements of M^^. where node (A^ — + 1 ) is infective 

and where node {N — k+\)' s recovery would lead to a motif of type 

— (number of elements of Mj^ j of which node (A'^ — A: + 1 ) belongs )] 



+7 



Nm (5y ) + (number of elements of M^. where node (A' — A + 2) is infective 
and where node {N — k + 2)'s recovery lead to a motif of type m) 
- (number of elements of My^^j of which node (A'' — A: + 2) belongs )] 



+ ... 



+7 Afe('S'*) + (number of elements of M'^j where node (A'^) is infective 

and where node N's recovery would lead to a motif of type m) 
— (number of elements of Mj^ j of which node (A'^) belongs )] , 



grouping the terms we obtain 

'Arrf,(5*-i)C*-il . =YkN^{S)) + 7 ^ N\m'+^,m) - y\Mkj\ {n' {m)) . 



Defining 



M, 



— ^ 

(=1 



A)+' = x^[Nf^ {mi-j,m) + ATf (m^-,m)] - T|M,,,-|iV«(m) - T ^ [iVf 



i=l 
\M, 



(=1 



C*-i =yJ2 - {N'{m)) 

(=1 
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and setting A''+^ = [A\+\a)+\. . . ,AJ+1] and C*^-' = [Cf-^C)-^ . . .,C'^-\] yields, 

N-l N 



k=l k=l 
N-l 

N 

Y,[rkN^iS')-{kN„-,{S')+C'- 



k=l 
N-l 



k=l 

N-l CI, N Q 

k=l J=l k=l ;=1 



N-l ct 

i:=l /=1 ,= 1 ,= 1 



*:=! ;=1 I (=1 
N ct 



EE 

k=l j=l 



^E [N'{m'+^,n^)]-y\M,,\{N'{m)) \xj 



=xM^^{m-,m) + TjV^^{m-,m) - T\M\Nil (in) - TN^i{m) 
+ jN'{m^,m) - Y\M\N'{m). 

Which matches equationfTTlfrom Theorem 1. 
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5.2 Proof that the conjectured exact effective degree model is derivable from the Kolmogorov 
equations 

Letting m be an 5v.,-type motif from the effective degree model earlier and using Theorem 1, we find that the exact 
equations can be written as 

tM,i'{m-,m) + TA/'f - T|M|Arf (m) - TA/^^(m) 

+ YJ^'{m^,m) - Y\M\N'{m) 

T X (the total expected number of SI connections within 5s+i ,_i-type motifs 

where if infection occurs we obtain a 5s.,-type motif ) 
+ T X (the total expected number of SI connections where S lies within 
5^.s+i,;-i-type motifs and the 1 is external to the given motif 
and where, were an infection to occur, we obtain a 5i,,-type motif ) 

— tSsj X (number of SI connections within an individual 5i.,-type motifs) 

— T X (the total expected number of SI connections where S belongs to 
5s,, -type motifs and the I is external to the given motif ) 

+ 7 X (the total expected number I's within 5j_i,i+i-type and /v,,-type motifs 
where there recovery would give a 5j_,-type motif ) 

— jSs.i X (number of I within an individual Ss.rtype motif) 
T - xiS,j - T [ISS,,] + yh, + y{i + 1 )5,_i,,-+i - 7/5,,,- 

which is indeed the conjectured exact equation for Ss,i (similar derivation holds for Isj). To clarify the above derivation 
we note that a term such as zJVfJ {m^ ,m) will make no contribution to the resultant equation as there are no internal 
SI connections within 5i_i^;+i-type motifs along which an infection would lead to an 5j ,-type motif. However other 
terms, such as TjV^'{m~ ,m), have a direct correspondence with the resultant output (in this case the T[ISSs+i,i-i] 
term). 

6 Comparison of the closed models 

In comparing the models the obvious question to ask is when does one model perform better than another, i.e. which 
model approximates better or more accurately the simulation results or the solution of the Kolmogorov/master equa- 
tions where solvable. As discussed earlier, the pairwise model is known to perform well on networks that are well 
characterised by the average degree (i.e. regular random and Erdos-Renyi graphs). What is less known is under what 
circumstances do the heterogenous pairwise and effective degree models outperform one another 

To assess the performance of the three closed models we compared individual simulations to the solutions of the 
ODE's on four different types of undirected network. Firstly we use regular random networks where all nodes have the 
same number of randomly chosen neighbours. Secondly, on an Erdos-Renyi random network where the distribution 
of degrees converges to a Poisson distribution. Figure |3]plots simulation results against the different solutions of the 
ODEs for these two networks. On the regular network, whilst the two different pairwise models and the effective degree 
offer an improvement in performance over the standard meanfield equations, there is little to distinguish between the 
improved approaches. On the Erdos-Renyi random networks, the pairwise model improves on the meanfield model 
and, in turn, the effective degree and heterogeneous pairwise models improve even further on this. Again, however, 
there is little to distinguish between effective degree and the heterogeneous pairwise models. 

To investigate further we ran simulations on networks exhibiting greater heterogeneity. Firstly on a bimodal net- 
work generated by the configuration model algorithm HI 81 and secondly on assortative bimodal networks generated 
by rewiring the aforementioned bimodal networks according to the algorithm of Newman, 1 17 |. The results are illus- 
trated in Figure |4] Whilst on random bimodal network there is little difference between heterogeneous pairwise and 



dt 
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Figure 3: ODE performance on different networks Each network is of size ~ 500 and with disease parameters 
given by 7 = 1 and T = 0.5. Average prevalence was calculated from individual simulations on 100 different networks, 
(a) Regular random network, each node having degree 7. (b) Erdos-Renyi random network with average degree 7. 



18 



effective degree when assoitativity is added, there is a clear improvement in the performance of the heterogenous pair- 
wise model over the effective degree. This performance benefit must, however, be considered in terms of the model 
complexity given in table [3] (note in this table M is the maximum possible degree in the network and we given the 
minimum number of equations needed to implement the ODEs). 




-simulation average 

■ pairwise 

■ heterogeneous pairwise 

■ effective degree 



2.5 
Time 

(a) Bimodal 



3.5 



4.5 




(b) Assortative bimodal 



Figure 4: ODE performance on different networks Each network is of size A' = 500 and with disease parameters 
given by 7 = 1 and T — 0.5. Average prevalence was calculated from individual simulations on 100 different networks, 
(a) Bimodal configuration model with 207 nodes of degree 4 and 293 nodes of degree 10. (b) Assortative bimodal 
network, with same degree distribution as (c) but rewired to have assortativity coefficient r k, 0.56. 

A final comparison between the performance of the different closed models is to look at their rate of convergence 
to the solution of the Kolmogorov equations on a complete (fully connected) network. On a complete network it is 
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Table 3: Complexity of closed ODEs 



Model # equations complexity 

meanfield 1 

pairwise 3 ^(1) 

effective degree M(M + 3 ) - 1 O (M^ ) 

heterogeneous pairwise 2M(M+1)-1 0{M^) 

Kolmogorov equations 2^ 0{2^) 



possible (see fT\\) to reduce the full system of 2^ equations to just 1 equations. This allows us to compare the true 
solution to the approximate solution of the meanfield, pairwise (equivalent to heterogenous pairwise on a complete 
graph) and effective degree models. Interestingly we find that all three exhibit 0(1 /N) convergence, where although 
both pairwise and effective degree bring an improvement on meanfield, the difference between the convergence of the 
two is neglible and almost indecernible (see figure|5]l. 




Figure 5: Convergence to exact solution on a complete graph. Absolute difference between the exact steady state 
solution of the percentage of infected individuals and those calculated from three different ODE models for 10 different 
network sizes and initial prevalence of 40 percent. Black triangles represent meanfield, blue circles effective degree 
and red squares the pairwise equations. Linear lines of best fit are also shown. This shows that the error(N) appears to 
be of 0(1/N) as N tends to infinity. 



7 Discussion 

In this paper we set out to achieve a greater understanding of the relation between some of the more common 
approaches to modelling disease dynamics. In doing so we conjectured an exact version of the effective degree 
model 1 15 1 and showed how this model could be used to recover the pairwise model lITJl . We then extended this 
model to incorporate greater network structure and illustrated how, from this extension, we could then recover the het- 
erogeneous pairwise model [6 1. We then proved that the conjectured exact effective degree model was indeed exact by 
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proving that a heuristic derivation of an ODE model for an arbitrary motif was derivable directly from the Kolmogorov 
equations and noting that the exact effective degree model was just a particular case of this heuristic model. Finally we 
considered the performance of the different models on four different type of networks and have analysed numerically 
the rate of convergence to the lumped Kolmogorov equations on a complete network. These comparisons suggest a 
performance hierarchy of models as illustrated in Figure 0and it is worth noting that the performance benefit of the 
heterogenous pairwise model on networks exhibiting susceptible infectious — > removed (SIR) disease dynamics 
was also touched upon in 14|. 

Whilst we have shown how current models can be extended in a way that can capture more network topology, these 
extensions have a more theoretical rather than practical motivation as their added complexity makes them not only less 
tractable but also more resource intensive in their solving, thus making the use of simulations more of an attractive 
proposition. As the links between these models are better understood, future work will likely focus on the following 
three areas. Firstly, a more realistic network will have a more clique-like structure. For example an individual is 
likely a member of a household in which he has regular contacts within and less regular contacts outside. Being 
able to incorporate this household structure within epidemic models is thus important in understanding the outbreak 
and necessary curtailment of an infectious disease (see ^ |9] |22l). Secondly, a network of individuals is not well 
represented by a static network. An individual may have regular contact with few individuals but may create or break 
contacts with others in ways that a static network representation cannot capture. For this reason it is important to take 
into consideration not only the dynamics of the disease but also the dynamics of the network and how the two impact 
on one another (see |j7]|T2l). Thirdly, assuming we can write down exact differential equations we have to close them 
in some way. Understanding the performance of current, and also the derivation of new closures, is arguably the most 
important task ahead as it is the closures that limit the performance of any system of ODEs. 




Figure 6: Model performance hierarchy. Model performance hierarchy based on our observations. Here K represents 
the Kolmogorov equations and HetPW the heterogeneous pairwise equations. 
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Appendix 1 

Derivation of the pairwise equation from the exact effective degree model for singles and pairs are as follows, 

±[s] = ^s-j = r[i]-T[si], 

s,i 

where most terms from the original effective degree equations cancel and we have used that iSg^i = [SI] and 
J2s i^s,i = For the pairs the effective degree model yields, 

j,[ss]=T.^s-^ 

= - T ^ siSsj + yYI si,j + s{i+i )Ss-i,i+i - rXl 

= - x[iss] + Y[is] + - 1) (i + 1 + yJ2 + 1 

- y[ISS] + + l)[/55,+i,,_i] - [ISS,+i,i-i] - xY,4lSSs,i] 
= - T[ISS] + y[IS] + y[ISS] + Y[IS] - y[iss] - x[ISS] 

= -2t[ISS]+2y[IS], 

s,i 

+ T^.9[/5/,+ i,-_i] -tJ2s[ISI,j] 

=t[iss] - Y[is] + yJ2 - !)('•+ + rE + 

- Y[IIS] + (5 + - - T^5[/5/,,i] 

=x[iss] - y[is] + y[iis] + y[ii] - y[iis] - t{[ISI] + [IS]) 

=Ti[ISS]-[ISI]-[IS]) + Ym-[m), 

s,i 

=xJ2'^S,j - yY^ + yYI + - jY 

=T ^ /(/ - 1 )S.s.i + T ^ iSs,i - y[II] 

+ 7[m - yY. - 1 ^ '^M- 

+ {i - + [ISh+u-i] - T^/[/5/,,,-] 

=x[isi] + x[is] - y[ii] + y[iii] - y[iii] - y[ii] + ^[W/] + x[IS] 

=2x{[ISI] + [IS])-2y[II]. 
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Appendix 2 

Derivation of the heterogeneous pairwise equations from the effective degree with neighbourhood composition model 
with the neighbourhood composition model. For singles and pairs the following identities hold, 



^n= J2 s,,,=Y[n-T[s"i], 

\s'\ + \i'\=N 
\s'\+\i'\=N 



d 
dt 



S'S" 



\s'\+\i'\=n 



M 



k=l 



M 



k=l 



IS Sj ji 



k^l 



+^E^/E [j^\,a] +^E(^5+i) 



+ 7 



- T 



5'r 



+ 7 
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\s'\+\i'\=n 



M 



k-''k+ 



M 



+ T 



rE W - !)('•; + iXv;_,,v + rE ('/ + i)'^-v;_,'; 

E^:E[«V4j+^E(^;+i) 

^E[^^'^.^;„';J-^E^H^5^.v'/] 
[5'/" 



-r 



7|yr/j +7 

T 



77" 



5'7" 



T 75"5' - T 75'7" - T 5' 7" + 7 7' 7" - 7 5'7" , 



jl jn 



vl jn 



|s'|+|i'|=n 



k=\ 



E ^ll^'l^.',,-' - rE ''i^^'^' + I'E E ('^ + 

■tE4[^54v 



175*7 



-rE'/l''l^^'/ + ^E'/E^ 

TE'5(l''l-i)5y/ + TE'55,,,-7[/'/"] 
+ /[/'/"/] - rE'/ (l''l - i)'^.v'.'' - rE''^-^-'^'" 

E'/EK..c]+^E('/-0K„4- 
^E[^^\^.;J-^E'/[^5^.'/] 

27[7'7"] 



+ T 



=T [/5"7] 

+r 



7'7«7 



T 

-r 



7'5" 



jljnj 



+ T 



75'7" 



5^7" 



T 7'5"7 +T 7' 5" -27 7'7" +T 75'7'' +T S'P . 



/ jn 
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